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We compared the sequence of an envelope protein 
gene fragment from 21 temporally distinct West 
Nile (WN) virus strains, isolated in nine African 
countries and in France. Alignment of nucleotide 
sequences defined two groups of viruses which 
diverged by up to 29%. The first group of subtypes 
is composed of nine WN strains from France and 
Africa. The Austral-Asian Kunjin virus was classified 
as a WN subtype in this first group. The second 
group includes 12 WN strains from Africa and 
Madagascar. Four strains harboured a 1 2 nucleo- 
tide in-frame deletion. The loss of the correspond- 
ing four amino acids resulted in the loss of the 
potential glycosylation site present in several WN 
strains. The distribution of virus subtypes into two 
lineages did not correlate with host preference or 
geographical origin. The isolation of closely related 
subtypes in distant countries is consistent with WN 
viruses being disseminated by migrating birds. 



West Nile (WN) virus is a flavivirus (family Flaviviridae) 
which shares antigenic properties with other members of the 
Japanese encephalitis QE) virus serogroup including JE, St Louis 
encephalitis (SLE), Murray Valley encephalitis (MVE) and 
Kunjin viruses (Monath & Heinz, 1996). WN virus is a 
mosquito-borne pathogen responsible for WN fever in humans 
(Hayes, 1988). Clinical symptoms associated with WN virus 
infection are most frequently those of a mild febrile illness, but 



fatal cases of acute meningoencephalitis and fulminant hepatitis 
have been reported (Monath & Heinz, 1996). WN virus is 
endemic in tropical areas, particularly India and Africa, where 
local proliferation of infected mosquitoes can lead to epizootics 
(Madagascar in 1982 and Senegal in 1988 and 1990) and 
sudden epidemics (Israel in 1950 and South Africa in 1974). 
Recent epidemics with high rates of patient mortality have 
been reported in Algeria (1995) and in Romania (Anon., 1996; 
Le Guenno et al, 1996) presenting WN virus as a potential 
emerging human-pathogenic virus. WN virus can infect a wide 
range of vertebrate species in nature (Hayes, 1988) and wild 
birds are believed to play an important role in the WN 
transmission cycle by disseminating the virus during migration 
(Hayes, 1988; Monath & Heinz, 1996). Because WN virus has 
a wide geographical distribution, regional variation was 
investigated from an early stage. Thus, Hammam et al. (1965) 
found that Indian and African WN virus isolates have different 
haemagglutination inhibition kinetics. Studies of cDNA/RNA 
heteroduplex restriction profiles and/or reactivity toward 
monoclonal antibodies (Besselaar & Blackburn, 1988; Mathiot 
et al, 1990) have identified several WN virus variants. More 
recently, Porter et al. (1993) sequenced NS3 protein gene 
fragments of seven African and one Indian strains and defined 
three categories on the basis of nucleotide sequence similarity. 

In this study, we report the classification of 21 WN strains 
(Table 1) isolated in Africa and France by direct nucleotide 
sequencing of a gene fragment obtained from genomic RNA 
by RT-PCR. Cytoplasmic RNA was extracted from virus- 
infected Aedes pseudoscutellaris AP61 cells using a previously 
described method (Deubel et al, 1993). Nucleic acid sequence 
analysis was performed on an envelope (E) gene fragment 
obtained by RT-PCR from positions 1318 to 1645 in the WN 
virus genome (Wengler et al, 1985). The choice was dictated 
by the variability of this region as demonstrated by com- 
parative amino acid sequence analysis in the flavivirus group. 
In particular, four amino acids at positions 154 to 157 in the E 
protein of Kunjin virus (the closest virus to WN virus in the 
flavivirus evolutionary tree) are absent from the published 
sequence of a Nigerian WN virus (Wengler et al, 1985). 
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Table 1 . Characteristics of the 21 West Nile strains analysed in this study 

WN strains were obtained from the WHO collaborating centres for reference and research on arboviruses 
(Pasteur Institutes in Paris and Dakar). Virus strains were recovered from and/or passaged on either suckling 
mouse brain (before 1990) or cell culture (after 1990). 



Strain 


Geographical origin 


Year of isolation 


Primary source of isolationt 


Nal047 


Kenya 


Unknown 


(M) Aedes alboihorax 


EntM63134 


Uganda 


Unknown 


Unknown 


MP22 


Uganda 


Unknown 


Unknown 


HEglOl 


Egypt 


1951 


Human 


PaH651 




1965 


Human 


ArAlg/Djanet 


Algeria 


1968 


(M) Culex sp. 


ArB310 


Central African Republic 


1967 


(M) Culex sp. 


AnB3S07 


Central African Republic 


1972 


(B) Antichromus minutus 


ArB3573 


Central African Republic 


1972 


(M) Culex tigripes 


HB83P55 


Central African Republic 


1983 


Human 


HB6343 


Central African Republic 


1989 


Human 


AnMg798 


Madagascar 


1978 


(B) Coracopsis vasa 


ArMg956 


Madagascar 


1986 


(M) Culex quinquefasciatus 


ArMg978 


Madagascar 


1988 


(M) Culex univittalus 


AnD27875 


Senegal 


1979 


(P) Galago senegnlensis 


ArD76104* 


Senegal 


1990 


(M) Mimomyia lacustris 


ArD 76986 


Senegal 


1990 


(M) Culex poicilipes 


ArD78016* 




1990 


(M) Aedes vexans 


ArD93S48 


Senegal 


1993 


(M) Culex neavei 


ArA3212 


Ivory Coast 


1981 


(M) Culex guiarti 



* Sequences of the two strains were confirmed using cDNA products obtained by direct amplification of virus 

RNA in mosquito lysate pools. 

t Mosquito (M), bird (B) or primate (P) species. 



Moreover, preliminary studies of WN E protein by Western 
blotting suggested a variability in E protein glycosylation 
status similar to that observed among Kunjin isolates (V. 
Deubel, R. Hall & J. Mackenzie, unpublished results; Adams et 
al, 1995). 

cDNA was synthesized from WN virus RNAs using primer 
WN240 (5' GAGGTTCTTCAAACTCCAT 3') and amplified 
using primers WN240 and WN132 (5' GAAAACATCAA- 
GTATGAGG 3') as described previously (Deubel et al., 1993). 
Primers WN132 and WN240 correspond to highly conserved 
sequences in viruses of the JE virus serogroup. DNA amplicons 
were purified by ion exchange chromatography and pre- 
cipitated with 2 vols of isopropanol. Each amplicon (I pmol) 
was mixed with 1-0 pmol of primer (WN132 or WN240) and 
sequenced using the Taq Dye Deoxy Terminator cycle 
sequencing kit (Applied Biosystems). Nucleic acid sequences 
were obtained on an automated Applied Biosystems 3 73 A 
sequencer. WN virus sequences were aligned with each other 
using the multiple sequence alignment software CLUSTAL V 
(Higgins & Sharp, 1988). 

The limited sequencing of nucleotides 436 to 690 in the E 
gene indicated a generally uniform rate of random nucleotide 
mutations (data not shown). However, four WN strains, 
including the Nigerian isolate described by Wengler et al. 



(1985), a strain from Uganda of unknown origin and two 
strains from Senegal isolated in 1990, showed a deletion of 12 
nucleotides (nt 462 to 473). The sequenced regions of three of 
the strains were 100% identical. Each strain was separately 
cultured in cells and sequences of the two Senegalese strains 
were identified in RNA extracted from the original mosquito 
pool, which was preserved at — 70 °C, thus excluding the 
possibility of laboratory contamination (see Table I). The 
maximum nucleotide divergence in the sequenced region was 
29%. Nucleic acid changes occurred at 99 positions (32% of 
the gene fragment); 69% of them were in the third codon and 
85 % were silent. 

To visualize the range of relationships among WN viruses, 
a phylogram was constructed from the nucleotide sequences 
obtained in this study using the neighbour-joining method 
(Saitou & Nei, 1987). Because of its similarity to WN virus 
species, Kunjin virus was also included. The strains formed two 
distinct lineages (I and II) of closely related subtypes circulating 
in large and overlapping geographic areas (Fig. 1). The 
branches leading to these two lineages showed bootstrap 
values of 1000 and 954 when 1000 trees were sampled, 
indicating the robustness of the groupings. The lengths of the 
branches on the phylogenetic tree were proportional to 
evolutionary distance. Nucleotide sequences of virus subtypes 
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Fig. 1 . Unrooted tree presenting the extent of nucleotide sequence 
identity from the E gene fragment of 21 WN viruses and Kunjin virus (Coia 
et al., 1 988). Nucleotide sequences have been deposited in the GenBank 
database; SEN-ArD7801 6, AF001 556; CAR-HB83P55, AF001 557 ; 
CAR-HB6343, AF001 558; MAD-AnMg798, AF001 559; FRA-PaH651 , 
AF001 560; IC-ArA321 2, AF001 561 ; UGA-MP22, AF001 562; RCA- 
AnB3507, AF001 563; MAD-ArMg956, AF001 564; RCA-ArB3573, 
AF001565; CAR-ArB310, AF001 566; ALG-ArDjanet, AF001567; EGY- 
HEg101, AF001568; SEN-AnD27875, AF001569; SEN-ArD93548, 
AFO01 570; KEN-Na1047, AF001 571 ; AUS-Kunjin, AF001 572; UGA- 
Ent631 34, AF001 573; MAD-ArMg978, AF001 574. The geographical 
origin, strain number and year of isolation are given for each isolate in 
Table 1 . Asterisks indicate the four strains with 1 2 nucleotide deletions in 
the E gene fragment. The phylogenetic tree was constructed using the 
neighbour-joining method (Saitou <& Nei, 1 987) using the CLUSTAL V 
program (Higgins & Sharp, 1988). Bootstrap confidence limits between 
the two defined lineages I and II were calculated from 1 000 replicate 
trees. The horizontal bar indicates the distance corresponding to 10% of 
nucleotide changes. 



in one lineage differed by a maximum of 29% from those of 
subtypes in the second lineage. Within lineage I the maximum 
identity of WN strains was 87% and within lineage II it was 
80-5 %. Kunjin virus shared more than 80% nucleotide identity 
with WN viruses of lineage I and was consequently classified 
as a subtype in this lineage. Lineage I is composed of nine WN 
virus strains from France and North (Algeria, Egypt), West 
(Senegal, Ivory Coast) and Central (Central African Republic, 
CAR) Africa. Lineage II includes 12 WN virus strains from 
West (Senegal, Nigeria), Central (CAR) and East (Uganda, 
Kenya) Africa and Madagascar. Strains ArD 76986 and 
ArD93548 isolated in Senegal in 1990 and 1993, respectively, 
have the same E gene fragment sequence. In contrast, strains 
ArD76104 and ArD78016 isolated in Senegal in 1990 differ 
from ArD 76986, showing that very different subtypes circulate 
simultaneously in the same country. Three strains from 



Madagascar were isolated in different regions of the island and 
from different vectors and hosts. Strain AnMg798, isolated 
from a parrot, was classified in lineage II with strains ArMg978 
and ArMg956 isolated from Culex species 8 to 10 years later, 
but showed more than 19% nucleotide divergence. In lineage 
II, strains with a 12 nucleotide deletion in the E gene fragment 
(indicated by an asterisk in Fig. 1) showed less than 5% 
nucleotide divergence from subtypes from CAR. 

Amino acid differences were identified at 15 positions. 
Alignment of amino acids 146 to 230 of WN virus strains with 
that of the 1951 Egyptian HEglOl prototype WN strain 
revealed a maximum divergence of 13 % (Fig. 2). Amino acids 
154 to 157 were absent from strains ArD76104, ArD78016, 
EntM63134 and Nigeria. The sequence at these positions in 
the other strains was NYST, NYPT or SYST. The triad NYS 
forms an N-glycosylation site at position 154 in the E protein 
in JE, MVE and SLE viruses, and in some Kunjin viruses (Adams 
et al, 1995). This confirms the previous observation that the E 
protein of French 1965 WN isolates (V. Deubel, R. Hall & J. 
Mackenzie, unpublished results) and that of the Sarafend WN 
strain (Ng et al, 1994) are glycosylated. However, we cannot 
exclude the possibility that passaging may have influenced the 
glycosylation status of the E protein of WN strains studied as 
described for Kunjin virus (Adams et al, 1995). Direct 
sequencing of cDNA amplified from virus RNA in mosquito 
pools or in patient sera would confirm the polymorphism of 
the glycosylation site in natural infecting viruses. Clear 
signature motifs at amino acid positions A -> S 172, N -» S 
199, T -> S 205, T -> A 208 and T -> S 210 confirmed WN- 
Kunjin genotype distribution into two lineages. The cor- 
responding published sequence from Kunjin showed only 4 to 
11 amino acid changes (5 to 13 % divergence) with WN strains, 
whereas other viruses from the JE serogroup exhibited 24 to 38 
amino acid changes (28 to 45% divergence). This result 
confirmed the close relatedness of Kunjin virus and WN viruses 
(Coia et al, 1988). 

Little information about the genetic diversity of WN 
viruses was available before this study. Comparative sero- 
logical analysis of WN strains indicated a variability between 
isolates from different hosts and vectors and geographic areas 
but also showed that different strains circulated in the same 
area at the same time (Gaidamovich & Sokhey, 1973 ; Hammam 
et al, 1965; Hammam & Price, 1966; Mathiot et al, 1990; 
Odelola & Fabiyi, 1976; Price & O'Learly, 1967; Umrigar & 
Pavri, 1977). We previously demonstrated that the Egyptian 
prototype HEglOl and the Nigerian strains diverge by 22% in 
the nucleotide sequences corresponding to the C terminus of 
NS5 and to part of the 3' non coding region (Pierre et al, 1994). 
Porter et al (1993) compared nucleotide sequences at 182 
positions in the NS3 gene of eight WN strains from seven 
countries and reported closer relationships between the 
Nigerian and Ugandan strains (99'5 % similarity) and to a lesser 
extent with Malagasy strain (86% identity), than with strains 
from CAR, Ethiopia, Egypt and India which shared more than 
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EGY-HEgl01/51 

FRA-POH651/65 . 
ALG-ArAlg/68 

SEN-AnD27875/79 . 

SEN-ArD76986/90 . 

SEN-ArD93548/93 . 
CAR-HB6343/89 

CAR-ArB310/67 . 

IC-ArA3212/81 . 



SEN-ArD76104/90 . 
SEN-ArD78016/90 . 
UGA-EntM63134/? . 
NIG 

CAR-AnB3507/72 . 

CAR-HB83P55/83 . 
UGA-MP22/? 

MAD-ArMg956/86 . 

MAD-ArMg978/88 . 



PTTVESHGNYSTQIGATQAGRFSITPAAPSYTLKLGEYGEVTVDCEPRSGIDTNAYYVMTVGTKTFWVHREWFMDLNLPWSSAGS 



Fig. 2. Amino acid sequences deduced 
from the 255 nucleotides used to 
determine genetic relatedness among 21 
WN and one Kunjin isolates. Sequences 
of the Nigerian WN (Wengler ef al., 
1 985) , Kunjin (Coia ef al., 1 988) , JE, 
MVE and SLE (collected in Trent er al., 
e aligned. The first 



amino acid characterized in 


the 


WN E 


protein sequence (Wengler 


ef al 


1985) 


is indicated. Dots indicate c 






identities with the Egyptian 


HEg 


01 WN 


and dashes show missing c 




The potential N-glycosylatic 






indicated by an asterisk. 







92% identity but diverged by about 24% from the first 
grouping. Our database on the nucleotide sequence of an E 
gene fragment confirm this classification and emphasize the 
substantial genetic variability among African strains, which are 
unrelated to geographical region, time and host/vector. 

Comparative genetic analysis of other flaviviruses including 
dengue, yellow fever (YF), tick-borne encephalitis (TBE) and JE 
viruses showed a similar degree of variability within each 
species, although the classification of TBE and YF viruses 
correlates with geographic origin and corresponds to topo- 
types (Zanotto el al, 1996). However, dengue viruses are 
carried from one continent to the other by viraemic patients 
travelling by air. Like JE virus, WN virus uses a variety of hosts 
and vectors for its maintenance and propagation in nature. 
Wild birds were quickly recognized as important vertebrate 
hosts for WN virus and their migrations are assumed to be 
instrumental in virus dissemination (Taylor el al, 1956). Thus, 
similar subtypes found in France and Senegal and in Senegal 
and Central Africa suggest continent-to-continent and land-to- 
land movements. The involvement of birds in the repeated 
import of WN strains may explain the multiplicity of subtypes 
in Senegal and in CAR. The data presented in Fig. 1 strongly 
support the idea that these viruses are free to move between 
regions and organisms, with no obvious selection. 

Sequencing a short fragment of the E gene of WN viruses 
responsible for recent epidemics, and strains from South Africa 
and from other parts of the world (former Soviet Union, Israel, 
Pakistan and India) would give an overall picture of strain 
variability and virus circulation and may bring new insights 
into possible relationships with phenotypic variability. 
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We report the first full-length infectious clone of the current epidemic, lineage I, strain of West Nile virus 
(WNV). The full-length cDNA was constructed from reverse transcription-PCR products of viral RNA from an 
isolate collected during the year 2000 outbreak in New York City. It was cloned into plasmid pBR322 under the 
control of a T7 promoter and stably amplified in Escherichia coli HB101. RNA transcribed from the full-length 
cDNA clone was highly infectious upon transfection into BHK-21 cells, resulting in progeny virus with titers 
of 1 x 10 9 to 5 x 10 9 PFU/ml. The cDNA clone was engineered to contain three silent nucleotide changes to 
create a Styl site (C to A and A to G at nucleotides [nt] 8859 and 8862, respectively) and to knock out an £coRI 
site (A to G at nt 8880). These genetic markers were retained in the recovered progeny virus. Deletion of the 
3'-terminaI 199 nt of the cDNA transcript abolished the infectivity of the RNA. The plaque morphology, in vitro 
growth characteristics in mammalian and insect cells, and virulence in adult mice were indistinguishable for 
the parental and recombinant viruses. The stable infectious cDNA clone of the epidemic lineage I strain will 
provide a valuable experimental system to study the pathogenesis and replication of WNV. 



West Nile virus (WNV) is found in many regions, including 
Africa, the Middle East, Europe, Russia, India, Indonesia, and 
most recently North America (9). Phylogenetic analysis of 
WNV strains has revealed two distinct lineages (I and II). 
Lineage I strains are frequently involved in human and equine 
outbreaks, and lineage II strains are mostly maintained in 
enzootic cycles (4, 30, 35, 36, 59). Sequence analysis showed 
that the strain in North America is closely related to other 
human epidemic strains isolated from Israel, Romania, Russia, 
and France, all of which belong to lineage I (35, 36). WNV has 
caused significant human, equine, and avian disease since its 
appearance in North America in 1999 (2, 28, 36), and the virus 
has quickly spread from the Northeast to the eastern seaboard 
and to the Midwest (3). There were 61 human cases (7 deaths) 
in New York City in 1999 (13); 21 human cases (4 deaths) in 
New York, New Jersey, and Connecticut in 2000 (42); and 48 
human cases (5 deaths) in New York, Florida, New Jersey, 
Connecticut, Maryland, Massachusetts, Georgia, and Louisi- 
ana in 2001 (14). 

WNV is a member of the Flavivirus genus, a group of ar- 
thropod-borne viruses in the family Flaviviridae. Besides WNV, 
many other members of the flaviviruses are important human 
pathogens, including dengue virus (DEN), yellow fever virus 
(YF), the tick-borne encephalitis virus complex (TBE), Japa- 
nese encephalitis virus (JE), and Murray Valley encephalitis 
virus (MVE) (9). The flavivirus genome is a single plus-strand 
RNA of approximately 11 kb in length that encodes 10 viral 
proteins in a single open reading frame (55). The encoded 
polyprotein is translated and co- and posttranslationally pro- 
cessed by viral and cellular proteases into three structural 
proteins (the capsid protein C; the membrane protein M, 
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which is formed by furin-mediated cleavage of prM; and the 
envelope protein E) and seven nonstructural proteins (the 
glycoprotein NS1, NS2a, the protease cofactor NS2b, the pro- 
tease and helicase NS3, NS4a, NS4b, and the polymerase NS5) 
(15, 39). The 5' and 3' untranslated regions (UTRs) of the 
genomic RNA are approximately 100 and 400 to 700 nucleo- 
tides (nt) in length, respectively, and the terminal nucleotides 
of both the 5' and the 3' UTRs can form highly conserved 
secondary and tertiary structures (7, 8, 55, 60). 

The establishment of a reverse genetic system for the WNV 
strain presently circulating in the United States is a critical step 
in the study of the epidemic North American strains of WNV. 
Infectious full-length cDNA clones for a number of flaviviruses 
have been successfully developed for the study of viral repli- 
cation and pathogenesis (56). In several cases, assembly of 
full-length flavivirus clones in a plasmid vector was not 
straightforward because clones containing large portions of the 
genome were unstable and deleterious for bacterial hosts. This 
problem was first circumvented for YF by ligating cDNA frag- 
ments in vitro prior to RNA transcription (54). Similar ap- 
proaches were applied to develop infectious clones for JE (62), 
DEN type 2 (DEN2) (31), and TBE strain Hypr (40). For other 
flaviviruses, stable full-length infectious clones were estab- 
lished for DEN4 (34), Kunjin virus (33), TBE strain Neudoerfl 
(40), MVE (27), and TBE strain Langat (11). Although an 
infectious clone of the lineage II WNV strain from Nigeria was 
recently reported (68), no such full-length cDNA clone has 
been developed for the human epidemic lineage I WNV. 

In this report, we describe the construction of a stable full- 
length cDNA clone of a WNV strain (lineage I) isolated from 
the epicenter of New York City during the 2000 outbreak (18). 
RNA transcribed from the cDNA clone was highly infectious 
upon transfection into cells, as shown by expression of viral 
proteins, production of progeny virus, and high specific infec- 
tivity. Genetic markers engineered into the cDNA clone to 
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distinguish recombinant from parental viruses were retained in 
the recovered virus. The recombinant virus and the parental 
virus showed similar biological properties in terms of plaque 
morphology, growth kinetics, and virulence characteristics. 
The infectious clone of the present epidemic WNV strain in 
North America will serve as a valuable reverse genetic system 
to study the molecular mechanisms of WNV pathogenesis and 
replication. 

MATERIALS AND METHODS 
Cells and virus. Vero (ATCC CCL-81) cells were grown in minimal essential 
medium (MEM) supplemented with 10% fetal bovine serum (FBS). BHK-21/ 
WI2 (BHK-21) (64) andAedes albopictus C6/36 (C6/36) (ATCC CRL-1660) cells 
were grown in Dulbecco's modification of MEM with 10% FBS and 0.1 mM 
nonessential amino acids. Antibiotics were added to all media at 10 U/ml of 
penicillin and 10 u.g/ml of streptomycin. Cells were maintained in 5% C0 2 at 
37°C (Vero and BHK-21) or 28°C (C6/36). The parental WNV strain 3356 was 
isolated from the kidney of an American crow collected in October 2000 from 
Staten Island, New York (18). A single viral stock was made from the second 
passage in Vero cells without plaque purification, stored as aliquots at -80°C, 
and designated as parental WNV 3356. This stock was used as parental virus in 
all assays. Plaque assays were performed on Vero cells as described previously 
(53). 

cDNA synthesis and cloning. BHK-21 cells were infected at a multiplicity of 
infection (MOI) of 0.05 with parental WNV 3356, and virus was harvested from 
cell culture media at 36 h postinfection. Genomic RNA was extracted from the 
cell culture media with RNeasy (Qiagen, Valencia, Calif.). cDNA fragments 
covering the complete genome were synthesized from genomic RNA through 
ThermoScript reverse transcription (RT)-PCR according to the manufacturer's 
instructions (Gibco BRL, Rockville, Md.). Plasmid pBR322 was modified by 
replacement of the Sphl-EcoRl fragment in the tetracycline resistance gene with 
a pair of complementary oligonucleotides to create the sequence 5'-GCATGgA 
TCC CGTTGC GCATG CrGATTCGAACCG ACTAGT-CTCGAG-TCTAGA 
ATTC -3' to yield plasmid pBRlinker containing the unique restriction sites 
BamHl, Sph\, Spel, Xhol, and Xbal (listed in order and underlined). After the 
modification, the original Sphl site of pBR322 (italics) was mutated through a C 
to G substitution (lowercase italics). The modified pBR322, pBRlinker, was used 
as the cloning vector throughout the experiments. 

Bacterial strain HB101 (Gibco BRL) was used as the host for construction and 
propagation of cDNA clones. Standard cloning procedures were followed (57), 
except that constructs with inserts of greater than 3 kb were propagated at room 
temperature. Electroporation was performed to transfect plasmid into bacteria 
in 0.2-cm cuvettes, using a GenePulser apparatus (Bio-Rad, Hercules, Calif.) 
with settings of 2.5 kV, 25 u.F, and 200 SI. The virus-specific sequence of each 
intermediate cloning product was validated by sequence analysis (Applied Bio- 
systems, Foster City, Calif.) before it was used in a subsequent cloning step. All 
restriction endonucleases were purchased from New England Biolabs (Beverly, 

RNA transcription and translection. Plasmid pFLWNV, containing the full- 
length cDNA of WNV, was amplified in Escherichia coli HB101 and purified 
through MaxiPrep (Qiagen). For in vitro transcription, 5 u,g of pFLWNV was 
linearized with Xba\. Mung bean nuclease (5 U; New England BioLabs) was 
directly added to the Xbal digestion reaction mixture, and the reaction mixture 
was further incubated at 30°C for 30 min to remove the single-stranded nucle- 
otide overhang generated by the Xbal digestion. The linearized plasmids were 
extracted with phenol-chloroform twice, precipitated with ethanol, and resus- 
pended in 10 u.1 of RNase-free water at 0.5 u.g/u.1. The mMESSAGE m MA- 
CHINE kit (Ambion, Austin, Tex.) was used to in vitro transcribe RNA in a 
20-u.l reaction mixture with an additional 2 u.1 of GTP solution. The reaction 
mixture was incubated at 37°C for 2 h, followed by the addition of DNase I to 
remove the DNA template. RNA was precipitated with lithium chloride, washed 
with 70% ethanol, resuspended in RNase-free water, quantitated by spectropho- 
tometry, and stored at -80°C in aliquots. A mutant RNA transcript with a 
deletion of the 3'-terminal 199 nt of WNV was generated from pFLWNV 
linearized with an internal restriction site Dral at nt position 10830. The mutant 
RNA was synthesized in the same manner as the full-length RNA, as described 
above. All procedures were performed according to manufacturer protocols. 

For transfection, approximately 10 u.g of RNA was electroporated to 10 7 
BHK-21 cells in 0.8 ml of cold phosphate-buffered saline (PBS), pH 7.5, in 0.4-cm 
cuvettes with the GenePulser apparatus (Bio-Rad) at settings of 0.85 kV and 25 
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FIG. 1. Construction of the full-length cDNA clone of WNV. 
Genome organization and unique restriction sites as well as their 
nucleotide numbers are shown at the top. Four cDNA fragments 
represented by thick lines were synthesized from genomic RNA 
through RT-PCR to cover the complete WNV genome. Individual 
fragments were assembled to form the full-length cDNA clone of 
WNV (pFLWNV) as described in Materials and Methods. The com- 
plete WNV cDNA is positioned under the control of T7 promoter 
elements for in vitro transcription. Three silent mutations (shown in 
lowercase) were engineered to create a Sty\ site (*) and to knock out 
an £coRI site (A) in the NS5 gene. The numbers are the nucleotide po- 
sitions based on the sequence from GenBank accession no. AF404756. 



|iF, pulsing three times, with no pulse controller. After a 10-min recovery, cells 
were mixed with media and incubated in a T-75 flask (5% C0 2 at 37°C) until 
cytopathic effects (CPE) were observed. Virus was harvested as tissue culture 
media, clarified by centrifugation at 10,000 X g, stored in aliquots at -80°C, and 
designated as recombinant WNV. Plaque assays were performed on Vero cells as 
described previously (51). 

Genetic marker analysis of the recombinant and parental virus. Genetic 
markers of Styl and EcoRI were engineered into the cDNA clone (Fig. 1) to 
distinguish recombinant progeny virus from the corresponding parental virus. 
Recombinant virus harvested from supernatant on day 3 posttransfection and 
parental virus were subjected to RNA extraction with RNeasy (Qiagen). A 
388-bp fragment including the genetic markers was amplified through RT-PCR 
from RNA extracted from either recombinant or parental virus with primers 
8706V (5'-CATGGCCATGACTGACACTACTC-3') and 9093C (5'-CTTGGC 
CTTTCCGAACTCTCCG-3'). The RT-PCR products were digested with Styl or 
£coRI and analyzed on a 2% agarose gel. 

IFA. Indirect immunofluorescence assays (IFA) were used to detect viral 
protein expression in WNV RNA-transfected BHK-21 cells. After electropora- 
tion, approximately 10 s transfected cells were spotted onto 10-mm glass cover- 
slips. Cells on coverslips were analyzed by IFA at various times posttransfection 
for viral protein synthesis. Cells were fixed in 3.7% paraformaldehyde with PBS, 
pH 7.5, at room temperature for 30 min followed by incubation in -20°C 
methanol for 30 min. The fixed cells were washed with PBS, incubated at room 
temperature for 45 min in WNV immune mouse ascites fluid (1:100 dilution; 
ATCC, Manassas, Va.), and further reacted with goat anti-mouse immunoglob- 
ulin G conjugated with fluorescein isothiocyanate at room temperature for 30 
min (1:100 dilution) (KPL, Gaithersburg, Md.). The coverslips were washed with 
PBS, mounted to a slide using fluorescent mounting medium (KPL), and ob- 
served under a fluorescence microscope equipped with a video documentation 
system (Zeiss, Thornwood, N.Y.). 

Specific infectivity assay. Approximately 10 u,g of RNA was electroporated to 
10 7 BHK-21 cells, as described above. Both transfected and untransfected 
BHK-21 cells were adjusted to a concentration of 6 X 10 s cells per ml. A series 
of 1:10 dilutions were made by mixing 0.5 ml of transfected cells with 4.5 ml of 
untransfected cells. One milliliter of cells (6 X 10 s cells total) for each dilution 
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Primer" 


Primer sequence* 


Amplified fragment" 


IV* 


ATOTTC^^CTGG^GGTCCA 


BamHl-Sphl 


3839C 




3286V 


GTAGAGATTGACTTCGATTAC 


Sphl-Spel 


8804C 


CGTACTTCACTCCTTCTGGC 




8016V 


GCCCCAACTAGTGCAAAGTTATGGATGGAAC 


Spel-Xhol 


8881 C 


ATTCTTCTCGAGAGCACATcCTtGGACGTTTTTCTCTGGCC (Xhol. Stvl) 




8865V 


GTGCTCTCGAGAGGAgTTCATAAGA (Xhol. EcoKW) 


Xhol-Xbal 


11029C 


AacaatctagAGATCCTGTGTTCTCGCACCAC (Xbal) 





" The primers were named after the nucleotide position of viral sequence and polarity. V, viral genomic sense; C, complementary sense. Nucleotide numbering is 
based on the sequence from GeneBank accession no. AF404756. 

* Viral and nonviral sequences are in uppercase and lowercase, respectively. Restriction endonuclease sites are underlined and in parenthesis. Silent mutations within 
the viral sequences are depicted in lowercase. 

c cDNA fragments used to construct the full-length clone are shown in Fig. 1. 

d Italicized sequence represents the T7 promoter. 

' EcoRIA represents the knockout of an EcoRl site in the cDNA clone by a silent A to g substitution. 



was seeded per individual well of sbi-well plates. Triplicate wells were seeded for 
each cell dilution. The cells were allowed to attach to the plates for 4 to 5 h under 
5% C0 2 at 37°C before the first layer of agar was added, as described previously 
(53). After incubation of the plates for 3 days under 5% C0 2 at 37°C, a second 
layer of agar containing neutral red was added. Plaques were counted after 
incubation of the plates for another 12 to 24 h, and the specific infectivity was 
calculated as the number of PFU per microgram of RNA. 

Growth curves. Subconfluent BHK-21 and C6/36 cells in 12-well plates were 
inoculated with either the parental or recombinant WNV at an MOI of 5 or 0.05 
in triplicate wells. Virus stocks were diluted in BA-1 (M199-H [Gibco-BRL], 0.05 
M Tris, pH 7.6, 1% bovine serum albumin, 0.35 g of sodium bicarbonate/liter, 
100 U of penicillin/ml, 100 (ig of streptomycin/ml, and 1 p-g of amphotericin B 
[Fungizone]/ml). Attachment was allowed for 1 h under 5% C0 2 at 37"C or 
under 5% C0 2 at 28°C for the BHK-21 and C6/36 cells, respectively. The inocula 
were then removed, the monolayers were washed three times with BA-1, and 2 
ml of medium was added to each well. The plates were incubated for up to 6 days 
under 5% C0 2 at 37°C or under 5% C0 2 at 28°C for the BHK-21 and C6/36 cells, 
respectively. The medium was sampled immediately after the addition of me- 
dium (1-h time point) and at 7.5, 16, 24, 32, 40, 48 and 72 h for BHK-21 and 
C6/36 cells, as well as at 96 and 124 h for C6/36 cells. The 10-u.l samples were 
stored at -80°C prior to titration as previously described (53). Cells were ob- 
served daily for CPE. 

Virulence in mice. Mice were housed in an environmentally controlled room 
under biosafety level 3 conditions and were given food and water ad libitum. 
Female outbred CD-I mice (Charles River Laboratories, Wilmington, Mass.) 
were obtained at 5 weeks of age and were acclimatized for 1 week. All mice were 
6 weeks of age at the start of the experiment. Eight mice per group were 
inoculated with diluent alone or with 10 2 PFU of parental or recombinant virus 
subcutaneously (s.c.) in the left rear footpad. Diluent was PBS (endotoxin-free) 
supplemented with 1 % FBS. Mice were evaluated clinically and weighed daily for 
2 weeks, then monitored daily and weighed thrice weekly for 2 more weeks. 
Observed clinical signs included ruffled fur, paresis, hindleg paralysis, and trem- 
ors. Morbidity was defined as exhibition of greater than 10% weight loss or 
clinical signs for 2 or more days. Mice were euthanized if they became moribund. 
Exposure to virus was confirmed in all surviving mice by a positive antibody titer 
to WNV by enzyme-linked immunosorbent assay on day 28 postinoculation. 

Statistical analyses. Microsoft Excel 97 was used for all statistical analyses. A 
chi-square test was used to compare the morbidity and mortality in mice for the 
parental and recombinant viruses, and a two-tailed Student's ( test was used to 
evaluate the survival time for the two groups. 

RESULTS 

Construction and sequencing of the full-length WNV cDNA 
clone. Selection of the appropriate plasmid vectors and host 
bacterial strains is critical for construction of flavivirus full- 
length cDNA clones (56). After testing different vectors and 
bacterial hosts, plasmid pBR322 and E. coli HB101 were cho- 
sen for cloning throughout the study. A polylinker containing 
a number of unique restriction sites was engineered into 



pBR322 to facilitate the cloning procedure (see Materials and 
Methods). Cloning procedures for bacterial propagation had 
higher success rates when performed at room temperature 
than did those performed at 37°C. All constructs with inserts of 
greater than 3 kb were propagated at room temperature, but 
later experiments showed that propagation of the clones at 
room temperature was not necessarily essential once the clones 
had been constructed. 

Figure 1 shows the overall scheme of the cloning strategy. 
The primers used to generate individual fragments are listed in 
Table 1. The full-length cDNA clone was constructed in four 
steps. First, a fragment from Spel to Xhol was amplified by 
primers 8016V and 8881C and cloned into pBRlinker at their 
respective sites (Fig. 1), yielding plasmid pSpe-Xho. In order 
to distinguish the recombinant virus from the parental virus, a 
Styl site (underlined) was designed in primer 8881C containing 
silent mutations of nucleotide C to A and A to G at positions 
8859 and 8862 (underlined lowercase), respectively. Second, a 
fragment covering Xhol to Xbal was amplified by primers 
8865V and 11029C and inserted into plasmid pSpe-Xho to 
generate clone pSpe-Xba (Fig. 1). Primer 8865V contained an 
A to G substitution at nt 8880 (underlined lowercase) to knock 
out an EcoRl site within the sequence of the parental virus 
(EcoRIA in Fig. 1 and Table 1). Third, a fragment covering 
Sphl to Spel was amplified by primers 3286V to 8804C and 
cloned into pBRlinker at their respective sites, resulting in 
plasmid pSph-Spe. The fragment from Sphl to Spel was then 
subcloned into pSpe-Xba to yield pSph-Xba (Fig. 1). Finally, a 
cDNA fragment from BamHl to Sphl was amplified by primer 
IV and 3839C (Table 1) and cloned into plasmid pSph-Xba at 
the sites of BamHl and Sphl. Primer IV contained the T7 
promoter sequence (italicized lowercase) following the BamHl 
cloning site (underlined lowercase in Table 1). The resulting 
plasmid, pFLWNV, contained the complete WNV cDNA un- 
der the T7 promoter for in vitro transcription. For RNA syn- 
thesis, the full-length cDNA plasmid was linearized by Xbal 
and the overhang nucleotides resulting from Xbal digestion 
were removed by mung bean nuclease. RNA that was in vitro 
transcribed from this DNA template had an authentic 3' end of 
the WNV genomic RNA and a 5' nonviral G nucleotide de- 
rived from the T7 promoter. Analysis of the RNA transcript on 
a formaldehyde-denaturing agarose gel showed a single band 
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FIG. 2. Transcription of WNV RNA. Formaldehyde-denaturing 
1.0% agarose electrophoresis of RNA transcript together with 
genomic RNA purified from WNV. 



with a mobility identical to that of genomic RNA extracted 
from WNV (Fig. 2). 

Sequence analysis of the full-length cDNA clone showed 11 
nucleotide changes compared with the parental virus sequence 
(35) (Table 2). All nucleotide changes were silent mutations 
except a T to C transition at nt 7826, which resulted in a 
conservative change from a valine to an alanine residue. Three 
nucleotide changes were intentionally designed, two to create 
a Styl site (C to A and A to G at nt 8859 and 8862, respectively) 
and a third to knock out an £coRI site (A to G at nt 8880). 
These restriction sites were used as genetic markers to distin- 
guish the recombinant virus from the parental virus (see be- 
low). Other mutations in the cDNA clone may derive from the 
quasispecies of the original virus stock because the parental 
virus was not plaque purified. It is also possible that some of 
the mutations occurred during the cloning procedures. 

RNA transcript from WNV cDNA clone was highly infec- 
tious. Capped RNA transcript was synthesized from the Xbal- 
linearized full-length cDNA plasmid by using 17 RNA poly- 
merase and an optimized 4/3 ratio of methylated cap analogue 
to GTP. Approximately 30 to 40 u.g of RNA was generated 
from 2 |xg of DNA template in a 20-u.l reaction mixture. In- 
creasing the ratio of cap analogue to GTP substantially re- 
duced the RNA yield. We routinely electroporated 10 u,g of 
RNA transcript to 10 7 BHK-21 cells as described in Materials 
and Methods. The transfected cells were incubated and ob- 
served for CPE. Apparent CPE were observed in cells on day 
3 posttransfection. The cell culture medium was harvested, and 
virus titer was determined on Vero cells by plaque assays. High 
viral titers (1 x 10 9 to 5 X 10 9 PFU/ml) were consistently 
obtained. 

IFA were used to detect viral protein expression in BHK-21 
cells transfected with WNV RNA transcript (Fig. 3). No IFA 
staining was observed in cells 12 h posttransfection. At 24 h 
posttransfection, fluorescence was detected in the majority of 
cells. The staining intensity varied among the IFA-positive cell 
population. The fluorescent signal increased, and all cells were 



IFA-positive at 36 h posttransfection. To eliminate the possi- 
bility that the positive IFA was merely derived from translation 
of the transfected RNA rather than from RNA replication in 
cells, we transfected cells with a mutant RNA containing an 
expected lethal deletion of the 3'-terminal 199 nt of the 
genomic RNA. The 3' deletion RNA was synthesized from the 
cDNA plasmid digested with a WNV-unique Dral site (nt 
position 10830). No positive IFA staining was detected in cells 
at any time points posttransfection (data not shown). These 
results indicated that the positive IFA signals were initially 
derived from the replication of RNA in transfected cells and 
that progeny virus was subsequently generated and spread to 
neighboring cells through new rounds of infection. 

The specific infectivity of RNA transcribed from the full- 
length cDNA was determined in order to evaluate the effi- 
ciency of the system. The specific infectivity of RNA was esti- 
mated to be 5 X 10 4 to 1 X 10 5 PFU/u,g of RNA. Similar 
specific infectivity was obtained for a mutant RNA containing 
an extra four nucleotides (5'-CUAG-3') at the 3' end. The 
mutant RNA was synthesized from the Aftal-linearized DNA 
template without mung bean nuclease treatment. Genomic 
RNA purified from virus showed a specific infectivity of 5 X 
10 s to 1 X 10 6 PFU/ixg, approximately 10-fold higher than that 
of transcript RNA. Since uncapped RNA exhibits specific in- 
fectivity 10 2 - to 10 3 -fold lower than that of the capped tran- 
script (40, 54), the discrepancy of infectivity between viral and 
transcript RNA is most likely due to incomplete capping of the 
in vitro-transcribed RNA population or to sequence differ- 
ences, as outlined in Table 2. 

The stability of the clone was tested by propagating the 
E.coli HB101 hosting the full-length cDNA plasmid for six 
continuous passages. Restriction enzyme analysis of the plas- 
mid purified from each of these passages showed digestion 
patterns identical to that of the original pFLWNV (data not 
shown). BHK-21 cells transfected with RNA transcripts syn- 
thesized from the DNA of passage six showed specific infec- 
tivity and CPE indistinguishable from those of passage one 
cells, indicating that the full-length clone was stable. 

Recovered WNV derived from the cDNA clone retained ge- 



TABLE 2. Summary of sequence differences between the infectious 
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strain 3356 




Nucleotide 


Strain 3356 


cDNA / 
clone 


change 


Location 


1285 


T 


C 


Silent 


E 


3840 


T 


C 


Silent 


NS2A 


7015 


C 


T 


Silent 


NS4B 


7826 


T 


C 


V^A 


NS5 


8067 


G 


A 


Silent 


NS5 


8859* 


C 


A 


Silent 


NS5 


8862" 


A 


G 


Silent 


NS5 


8880" 


A 


G 


Silent 


NS5 


9123 


C 


T 


Silent 


NS5 


10613 


c 


T 


Silent 


3'UTR 


10783 


c 


T 


Silent 


3'UTR 



* Nucleotide position and sequence are based on WNV strain 3356 (GenBank 
accession no. AF404756). 

* Mutations were designed to generate an endonuclease Styl site as a marker 
for recombinant virus (Fig. 4). 

c This mutation was designed to knock out the endonuclease EcoRI site as a 
marker for recombinant virus (Fig. 4). 



Vol. 76, 2002 



INFECTIOUS cDNA CLONE OF THE EPIDEMIC WEST NILE VIRUS 5851 



DIC 



FITC 



12h 



24 h 




36 h 



FIG. 3. IFA of viral protein expression in cells transfected with full-length WNV RNA transcript. BHK-21 cells transfected with full-length 
WNV RNA transcript were analyzed by IFA at the indicated times posttransfection. Photomicrographs were taken at magnifications of X400. The 
left and right panels represent the same field of view for each time point. The left panels were visualized with differential interference contrast 
(DIC), and the right panels were visualized with a fluorescein isothiocyanate (FITC) filter set. For the IFA, WNV immune mouse ascites fluid and 
goat anti-mouse immunoglobulin G antibody conjugated with FITC were used as primary and secondary antibodies, respectively. 



netic markers. To exclude the possibility that the virus recov- 
ered from the transfected cells was due to contamination by 
the parental virus, we engineered genetic markers during the 
construction of the cDNA clones (Fig. 1). A Styl site was 
created and an Eco RI site was knocked out in the NS5 gene of 
the recombinant virus. A 388-bp fragment spanning the genetic 
markers from nt 8706 to 9093 was amplified through RT-PCR 
from RNA extracted from either parental or recombinant vi- 
rus. Digestion of the RT-PCR products with Styl and EcoRl 
revealed different cleavage patterns, depending on the origin 
of the RNA (Fig. 4). As expected, PCR products amplified 
from parental virus were cleaved by Eco RI to generate frag- 
ments of 173 bp and 215 bp (lane 4 in Fig. 4B), but were not 



digested by Styl (Fig. 4B, lane 3). In contrast, PCR products 
derived from the recombinant virus were not cleaved by EcoRl 
(Fig. 4B, lane 8) but were digested by Styl to generate frag- 
ments of 152 and 236 bp (Fig. 4B, lane 7). As a negative 
control, cells were transfected with RNA containing a deletion 
of the 3'-terminal 199 nt of the WNV genome (Fig. 4B, 3' 
dltn). These cells did not yield any RT-PCR product (lane 9). 
These results clearly show that virus recovered from the trans- 
fected cells was derived from the infectious full-length RNA 
transcript, not from contaminating parental virus. 

The phenotypes of recombinant and parental WNV were 
indistinguishable. There was no difference in plaque size or 
morphology on Vero cells between the recombinant and pa- 
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FIG. 4. Recombinant WNV retains the genetic markers engineered during cDNA construction. A Styl site was created and an EcoRI site was 
knocked out in the NS5 gene of the recombinant virus to serve as genetic markers to distinguish recombinant virus from parental virus. A 388-bp 
fragment (from nt 8706 to 9093) spanning the Styl or £coRI site was amplified through RT-PCR from RNA extracted from either recombinant 
virus or parental virus. The RT-PCR fragments were subjected to Styl and EcoRI digestion. The 388-bp fragment derived from recombinant virus 
should be cleaved by Styl but not by £coRI; the RT-PCR fragment amplified from parental viral RNA should be digested by £coRI but not Styl. 
(A) Schematic drawing of genetic marker analysis. The expected sizes of the digestion products are indicated. (B) Agarose gel analysis of genetic 
markers. Expected digestion pattern as depicted in panel A was observed. As a negative control, no RT-PCR products were detected from the 
extracted supernatant collected from cells 5 days after transfection with a mutant RNA containing a deletion of the 3'-terminal 199 nt of the 
genome (lane 9, 3' dltn). A 100-bp ladder was loaded on lanes 1, 5, and 10 as a standard. 



rental viruses (Fig. 5). One-step growth curves at a high MOI 
of 5 were similar for both recombinant and parental viruses on 
BHK-21 and C6/36 cells, and their growth characteristics at a 
low MOI of 0.05 were also equivalent on both cell types (Fig. 
6). Furthermore, no quantitative or qualitative differences in 
CPE were observed between the viruses at each MOI. These 
data suggest that the parental and recombinant viruses are 



indistinguishable in replication and spread in both mammalian 
and insect cells. 

The virulence phenotypes of the parental and recombinant 
viruses were compared in adult mice by inoculating 10 2 PFU 
s.c. The morbidity, mortality, and average survival times are 
reported in Table 3. The mortality after s.c. inoculation of 10 2 
PFU of parental and recombinant viruses was 62 and 50%, 
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FIG. 5. Plaque morphology of parental and clone-derived WNV on Vero cells. Vero cells in six-well plates were infected with 100 PFU of 
parental WNV 3356 (A) or 100 PFU of WNV derived from pFLWNV (B). Plaques were visualized 3 days postinoculation by staining for 24 h with 
neutral red. 



respectively. There were no observable differences in the se- 
verity or quality of the clinical signs, and the survival curves 
were very similar. Furthermore, there were no statistical dif- 
ferences in the morbidity (P = 0.25), mortality (P - 0.61), or 
average survival times (P = 0.58). Thus, the virulence pheno- 
types were indistinguishable for the parental and recombinant 
viruses. 

DISCUSSION 

We report the construction of the first full-length cDNA 
clone of the human epidemic strain of WNV (lineage I). RNA 
transcripts transcribed from the cDNA clone were highly in- 
fectious upon transfection into BHK-21 cells. The identifica- 
tion of genetic markers engineered into the clone confirmed 
that the progeny virus was derived from the cDNA clone and 
thus was not a contaminant. The infectivity of the cDNA- 
derived RNA was further supported by the finding that a mu- 
tant RNA with an expected lethal deletion of the 3'-terminal 
199 nt of the genome was not infectious. The recombinant 
virus showed biological properties indistinguishable from those 
of the parental virus, including plaque morphology, growth 
kinetics, and virulence characteristics. These results indicate 
that an efficient reverse genetic system has been established for 
lineage I WNV. 

A common difficulty in assembling full-length clones of fla- 
viviruses is that plasmids containing long fiavivirus-specific in- 
serts are unstable during propagation in bacteria. A number of 



TABLE 3. Morbidity and mortality of parental and 
recombinant WNV in adult mice" 



Inoculum 


Morbidity 
(no. sick/total) ( 


Mortality 
no. dead/total) 


Avg survival time 
(days [SD]) 


Diluent 


0/8 


0/8 


NA 


Parental WNV 


7/8 


5/8 


9.4(1.95) 


Recombinant WNV 


5/8 


4/8 


10.2 (2.50) 



" Mice were inoculated with 10 2 PFU s.c. in the left rear footpad. SD, standard 
deviation; NA, not applicable. 



approaches have been developed to assemble full-length g> 
cDNA clones of flaviviruses. (i) Full-length cDNA clone can be § 
assembled through in vitro ligation of cDNA fragments. This § 
approach avoids cloning and propagating full-length clones in g_ 
bacteria and has been successfully applied to generate infec- g - 
tious RNA of YF (54), JE (62), DEN2 (31), and TBE strain 3 
Hypr (40). (ii) Genome-length cDNA containing an upstream ~ 
promoter for transcription can be directly synthesized by a | 
one-step RT-PCR without any cloning. This rapid method was b 
successfully used to generate infectious RNA of TBE (19). *§. 
However, this approach has the limitation that the uncloned o 
PCR-derived cDNA will produce a heterogeneous RNA pop- ^ 
ulation, derived from mutations during RT-PCR or from qua- •< 
sispecies of the original virus stock, (iii) Full-length cDNA - <D 
clone can be assembled in yeast cells through homologous g 
recombination. This method was successfully used to assemble 
a full-length clone of DEN2 (49). (iv) Full-length cDNA can be 
cloned under the control of a eukaryotic promoter, and introns 
are introduced into the problematic regions of the cDNA to 
avoid mutations during their propagation in bacteria. This ap- 
proach requires transfection of eukaryotic cells with plasmid 
cDNA rather than RNA. An infectious JE clone was recently 
developed, using this approach, in which genomic RNA was 
made in situ by nuclear transcription and intron splicing in 
transfected eukaryotic cells (67). (v) Stable full-length cDNA 
clone can be constructed using low- or medium-copy-number 
vectors and selective bacterial hosts. This approach has been 
applied to a number of flaviviruses, including DEN4 (34), 
Kunjin virus (33), TBE strain Neudoerfl (40), MVE (27), TBE 
strain Langat (38), lineage II WNV (68), and lineage I WNV 
(this report). 

During the construction of our WNV clone, we found that 
the most unstable region of the genome was within its 5' 
quarter and that cDNA from this region should be assembled 
last in order to obtain full-length clones. These results are 
consistent with previous reports that cDNAs of structural re- 
gions are more likely to be unstable during cloning (67, 68). 
We also found that bacterial propagation at room temperature 
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Time post-inoculation (hr) 




FIG. 6. Comparison of the growth kinetics of recombinant and 
parental WNV. The growth of recombinant and parental viruses was 
compared at high and low MO Is on BHK-21 or Aedes albopictus C6/36 
cells. (A) Growth in BHK-21 cells. (B) Growth in C6/36 cells. Viruses 
were inoculated at an MOI of 5.0 (filled symbols) or an MOI of 0.05 
(open symbols) in triplicate in 12-well plates. Recombinant virus is 
designated by squares along a solid line. Parental virus is designated by 
triangles along a dashed line. Error bars represent ± standard devia- 
tion of triplicate wells. Dotted line indicates the limit of detection (500 
PFU/ml). 



rather than at 37°C yielded a higher success rate of cloning 
intact inserts. A recent report showed that cloning at room 
temperature was essential to the construction of infectious 
full-length DEN2 cDNA using high-copy-number plasmid vec- 
tors (61). 

The 3' UTR of flaviviruses is believed to function as a 
promoter for initiation of minus-strand RNA synthesis. The 
3'-terminal nucleotides of flavivirus genomic RNA were ther- 
modynamically predicted and experimentally demonstrated to 
form distinct secondary structures, including a short stem-loop 
(SL) adjacent to a long SL (8, 50, 52, 60). This secondary 
structure is conserved among divergent flaviviruses, although 
only short stretches of sequence are conserved. Structural anal- 
ysis of WNV 3' RNA reveals that the loop of the short SL 
interacts with the lower portion of the neighboring long SL to 



form a pseudoknot structure (60). Three host proteins bind 
specifically to the WNV 3' SL RNA (5), and one of these 
cellular proteins is the translation elongation factor, eFl-a (6). 
The NS5 (RNA-dependent RNA polymerase) and NS3 (pro- 
tease and helicase) of JE were shown to bind specifically to the 
3' SL RNA (17). The cyclization sequences in the 5'- and 
3'-terminal regions of the genome were recently demonstrated 
to be essential for flavivirus replication, both in vivo in Kunjin 
virus (32) and in vitro in DEN (1, 69). Furthermore, the 3' SL 
of WNV RNA was reported to suppress translation of mRNA 
(38). All of the above reports strongly suggest that the 3'- 
terminal region of the flavivirus genome plays an essential role 
in viral replication and possibly regulates translation. Our re- 
sults show that deletion of the 3'-terminal 199 nt of the ge- 
nome abolishes the infectivity of WNV RNA. These results 
agree well with previous reports that deletions in the 3' UTR 
of DEN4 (45) and many chimeric 3' UTRs between DEN2 and 
WNV (70) are lethal for viral replication. Since the 3'-terminal 
199-nt deletion in the mutant RNA included the 3' cyclization 
sequence (from nt 10925 to 10932) and the downstream 3'- 
terminal two-SL structure (from nt 10935 to 11029), it is im- 
portant to dissect their individual roles during viral replication 
through systematic mutagenesis analysis. 

Many different genetic determinants of virulence have been 
identified for the flaviviruses (43). For WNV, the studies have 
been limited. Chambers and coworkers (16) found that neu- 
roinvasion correlates with a mutation in the E gene and deter- 
minants outside the E gene. For the related viruses of the JE 
serocomplex, determinants of neuroinvasion and neuroviru- 
lence are in the E gene (12, 21, 22, 37, 44, 47, 65) and the NS1 
gene (20). For other flaviviruses, many of the putative viru- 
lence determinants are in the E gene (23-26, 29, 48, 58), but 
mutations in the NS1 gene (10, 46, 51), NS5 gene (66), and the 
5' and 3' UTR (10, 41) are also associated with virulence. 
Site-directed mutagenesis of the cDNA clone in this report will 
allow identification of molecular determinants of virulence for 
the epidemic strain of WNV. 

Lineage I WNV strains have been mostly isolated from ep- 
idemic outbreaks and epzootics in birds and equines and have 
a worldwide distribution. In contrast, lineage II strains have 
been incidentally isolated from humans with mild febrile dis- 
ease or without symptoms and are restrictedly found in sub- 
Saharan Africa and Madagascar (4, 35, 36, 59). Based on 
sequence analysis of the complete genomes, nucleotide iden- 
tity between the two lineages is approximately 75% (35). Lim- 
ited information is known about the pathogenic differences 
between lineages I and II and among strains within each lin- 
eage. Similar growth kinetics were observed for a lineage II 
Nigerian strain (68) and our lineage I New York strain for both 
mosquito and mammalian cells. Replacement of the 3'-termi- 
nal 1,438 nt of the Nigerian strain (lineage II) with the equiv- 
alent sequence (including the complete 3' UTR and sequence 
encoding the carboxy-terminal 287 amino acids of NS5) from 
the prototype WNV EglOl strain (lineage I) yielded a chimeric 
virus that showed growth kinetics similar to those of the wild- 
type Nigerian strain (68). Others have shown differences in 
neuroinvasiveness in mice between viruses from lineage I and 
II (D. W. C. Beasley, L. Li, M. T. Suderman, and A. D. Barrett, 
International Conference on the West Nile Virus, New York 
Academy of Science Poster Section 1:5, 2001). Lineage I 
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strains can be further divided into three clades: clade la in- 
cludes viruses from Africa, Europe, Russia, Middle East, and 
United States; clade lb includes Kunjin virus from Australia; 
and clade lc includes viruses from India (35). Within clade la, 
all U.S. isolates (including the New York crow strain 3356 used 
in this study) have a nucleotide identity of 99.8%, 99.7% with 
an Israeli 1998 strain, and 95.2 to 96.4% with strains from 
Europe, Russia, and Egypt. Kunjin virus in clade lb exhibits a 
nucleotide identity of 86.6 to 87.2% with strains in clade la (35, 
36). During the recent WNV outbreaks, bird mortality was 
observed in the United States and Israel but not in Europe; 
therefore, it was speculated that genetic variability within lin- 
eage I strains could affect pathogenicity (35, 36, 63). Lanciotti 
et al. (35) recently showed that six amino acid changes are 
consistent with the geographic origin of these viruses and 
might confer the pathogenic difference among these lineage I 
strains. It will be very interesting to use the infectious clone 
described here to experimentally test these observations. 

Many factors could contribute to the fact that lineage I 
WNV strains are frequently involved in human outbreaks, 
while lineage II viruses are mostly maintained in enzootic cy- 
cles (4, 36). In addition to possible differences in virulence, 
differences in vector competence and transmission cycles as 
well as host immunity may contribute to the difference in 
disease pattern between the two lineages. The infectious 
cDNA clones of WNV will serve as a valuable tool to address 
many of these questions. 
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Yellow fever virus (YF) is the prototype member of the Flavivirus genus. Here, we report the 
successful construction of a full-length infectious cDNA clone of the vaccine strain YF-1 7D. 
YF cDNA was cloned into a low-copy-number plasmid backbone and stably maintained in several 
E. coli strains. Transcribed RNAs had a specific infectivity of 10 5 -10 6 p.f.u. (ug RNA)" 1 , and the 
resulting virus exhibited growth kinetics, plaque morphology and proteolytic processing similar to 
the parental virus in cell culture. This clone was used to analyse the importance of conserved 
flavivirus RNA sequences and the 3' stem-loop structure in virus replication. The conserved 
sequences 5'CS and CS1 , as well as the 3' stem-loop structure, were found to be essential for 
virus replication in cell culture, whereas the conserved sequence CS2 and the region containing 
YF-specific repeated sequences were dispensable. This infectious clone will aid future studies on 

Received 26 September 2002 YF replication and pathogenesis, as well as facilitate the development of YF-1 7D-based 
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INTRODUCTION 

Yellow fever virus (YF) is the type member of the genus 
Flavivirus, a group of arthropod-borne RNA viruses in the 
family Flaviviridae (Lindenbach & Rice, 2001). The virus is 
endemic in the central regions of Africa and South America, 
where it exists primarily as a zoonosis among monkeys. 
Occasionally, YF is transmitted to human populations, 
resulting in epidemics with mortality rates of up to 60 %. 
Effective protection against infection is possible by vacci- 
nation with the attenuated YF-17D strain (reviewed in 
Burke & Monath, 2001). 

The YF genome is a positive-stranded RNA molecule of 
11-8 kb, with a 5' cap structure and a non-polyadenylated 3' 
terminus (Rice et al, 1985). The RNA encodes a single open 
reading frame (ORF) flanked by 5' and 3' non-translated 
regions (NTRs), which are 118 and 565 bases in length, 
respectively. Translation of YF RNA results in the produc- 
tion of a precursor protein that is cleaved by host and 
viral proteases to produce the mature viral proteins (see 



tPresent address: Institute for Animal Science and Health (ID-DLO), 
8200AB Lelystad, The Netherlands. 

tPresent address: Center for the Study of Hepatitis C, Laboratory of 
Virology and Infectious Disease, The Rockefeller University, New York, USA. 



Lindenbach & Rice, 2001, for a review). The N-terminal 
one-third of this polyprotein encompasses the structural 
proteins (C-prM-E). Proteolytic processing of the remain- 
der of the polyprotein yields the viral non-structural (NS) 
proteins (NS1-NS2A-NS2B-NS3-NS4A-NS4B-NS5). Repli- 
cation of the viral genome occurs in the cytoplasm and 
is associated with cellular membranes (reviewed by 
Lindenbach & Rice, 2001). 

All the mosquito-borne flaviviruses share conserved RNA 
sequences and structures (Fig. 1A). Sequence comparison 
and RNA secondary structure predictions of the 3'-NTR 
have revealed several short, well-conserved sequences and 
indicated that the 3 '-terminal region (approximately 90 
bases) can be folded in a conserved stem-loop structure (3'- 
SS) "(Brinton et al, 1986; Hahn et al, 1987; Proutski et al, 
1997a; Wengler & Castle, 1986). Apart from the sequence 
5'-CACAG-3' in the bulge at the top and A-U and G-C base 
pairs at the very bottom, this stem-loop structure is not well 
conserved in primary sequence. A short conserved sequence 
(CS1; ~26 nucleotides) has been identified upstream of 
3'-SS. Complementarity between CS1 and a conserved 
sequence at the 5' end of the YF ORF (5'-CS) has been 
proposed to result in a long-range intramolecular RNA 
interaction (Hahn et al, 1987). Recent experiments suggest 
that base-pairing between these sequences is essential 
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Fig. 1. (A) Schematic representation of the YF genome organiza- 
tion. The large open boxes representing the ORF encoding the 
viral structural and non-structural proteins is indicated. Cross- 
hatched box, the conserved sequence (CS) in the capsid gene; 
black boxes, the repeated sequences; hatched box, CS2; striped 
box, CS1. The stem-loop structure at the 3' end is depicted as a 
hairpin. The conserved structure and sequences are enlarged 
relative to the size of the ORF. (B) Schematic representation of 
the pACNR/FLYF-1 7Dx plasmid. The hatched part is the pACNR 
vector. The DNA fragments taken from pHYF5'3'IV are indicated 
by the open boxes and the DNA fragment cloned from pYFM5.2 
is indicated by the stippled area. The location of the SP6 RNA 
polymerase promoter and the restriction enzyme sites that were 
used to construct this full-length YF cDNA are indicated. Xho\ 
was used to linearize the plasmid for in vitro RNA transcription. 



for RNA replication of a Kunjin virus (KUN) replicon 
(Khromykh et al, 2001). 

The 3'-NTR contains another conserved sequence, CS2, 
which is approximately 24 nucleotides in length. In YF, CS2 
is located 22 nucleotides upstream of CS1. CS2 is dupli- 
cated in members of the dengue virus (DEN) and Japanese 
encephalitis virus (JE) subgroups. In addition to the RNA 
sequences and structures that are conserved among mosquito- 
borne flaviviruses, YF has an unique domain consisting of 
three stretches of closely spaced repeated sequences (RS) 
that are located just downstream of the YF ORF (Hahn 
et al, 1987). 

Based on computer-assisted RNA folding, phylogenetic 
sequence comparisons and biochemical and biophysical 
probing, several models for the RNA structure of the 3' end 
of the genome of the mosquito-borne flaviviruses have been 
proposed (Blackwell & Brinton, 1997; Brinton et al, 1986; 
Hahn et al, 1987; Olsthoorn & Bol, 2001; Proutski et al, 
1999; Shi era/., 1996)). Although these studies do not yield a 



consensus model for the flavivirus 3'-NTR structure, it is 
evident that the folding of the 3'-NTR is complex and 
involves many stem-loop structures and some potential 
RNA pseudoknots. DEN mutants with deletions in the 
3'-UTR have been described (Men et al, 1996), but their 
analysis does not favour any particular current RNA 
structure model. 

Attempts to construct a stable, full-length infectious YF 
cDNA in E. coli plasmid and X phage vectors have been 
unsuccessful due problems with the genetic stability of the 
full-length clone in the prokaryotic host (Rice et al, 1989). 
This problem was circumvented by using two plasmids and 
an in vitro ligation approach to create a full-length YF cDNA 
that could be used for the in vitro transcription of infectious 
YF RNA (Rice et al, 1989). Although cumbersome, this 
approach yielded the first functional flavivirus cDNA for 
in vitro transcription of infectious YF RNA. 

In this study, we describe the construction and characteri- 
zation of full-length YF cDNA in a low-copy-number vector 
that is stable in several different bacterial strains. The 
in vifro-transcribed RNA from this clone was shown to be 
highly infectious. The infectious clone was used to analyse 
the requirement for the conserved flavivirus RNA elements 
in YF replication. 

METHODS 

Cell cultures, virus stocks and plaque assays. BHK-21/J 
(Lindenbach & Rice, 1997) and SW13 cells were grown in Dulbecco's 
modified Eagle's medium (DMEM; Life Technologies) supplemented 
with 7-5% foetal calf serum (FCS). Virus stocks were obtained by 
harvesting the medium of BHK-21/J cells that were transfected with 
in vifro-transcribed RNA of wild-type or mutant YF-17D cDNA. The 
medium was harvested at 36 h post-transfection, clarified by centri- 
fugation (2500 g for 10 min) and stored at -80 °C. Virus stocks 
were titrated by plaque assays on SW13 and BHK21/J ceUs as 
described previously (Rice et al, 1989), except that DMEM instead of 
Eagle's minimal essential medium (MEM) was used in the overlays. 

Plasmid constructions. Standard nucleic acid methodologies were 
used (Ausubel et al, 2000; Sambrook et al, 1989). The E. coli strain 
MCI 061 was used for routine cloning purposes, whereas electro- 
competent E. coli Sure cells (Stratagene) were used as a host for the 
construction of full-length YF cDNAs. The cDNA fragments for the 
construction of the full-length YF cDNA were taken from the plas- 
mids pHYF5'3'lV and pYFM5.2. Plasmid pYFM5.2 has been descri- 
bed (Rice et al, 1989). pHYF5'3'IV is a phagemid and contains, in 
addition to the YF sequences of pYF5'3'IV (Rice et al, 1989), the fl 
origin for filamentous phage replication. Plasmid pACNR1181 was 
created from the low-copy-number vector pACNR1180, which con- 
tains the polylinker cassette of pSL1180 (Pharmacia) (Ruggli et al, 
1996). After digestion of pACNR1180 with Aatll and NotI, a short 
spacer sequence was inserted that resulted in the destruction of 
the Aatll site and deleted all the restriction enzyme sites between 
NoA and Sail in the polylinker cassette. The resulting pACNR1181 
was used as a vector to assemble a full-length YF-17D cDNA. 
pHYF5'3'IV was digested with NotI and Xhol. A 5-1 kb DNA frag- 
ment that contained the SP6 RNA polymerase promoter directly 
fused to the YF 5' end, the YF 5' 2271 bp, a small spacer element 
and the YF 3' 2586 nucleotides was isolated and cloned 
in NoflAXM-digested pACNR1181. The resulting plasmid, 
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pACNR1181YF5'3'IV, was digested with NsiL and AafU and ligated 
to the 6747 bp NsH-Aatll fragment from pYFM5.2 encompassing the 
middle part of the YF genome (nt 1655-8402). This resulted in the 
construction of pACNR/FLYF- 17Dx (Fig. IB), which contained a 
full-length YF-17D cDNA. 

Deletion mutagenesis of conserved RNA sequences and 
structures. All the deletion mutants were initially created in 
pHYF5'3'IV. The partial deletion of the YF 5'-CS (Table 1) was con- 
structed by fusion PCR (Landt et al, 1990). All the other deletion 
mutants were created using uridylated single-stranded pHYF5'3'IV 
DNA as a template to introduce additional restriction enzyme sites 
(Kunkel, 1985) flanking the conserved RNA sequence and structural 
elements in the viral 3'-NTR. HmdIII sites were inserted immediately 
5' and 3' of the RS, CS1 and CS2 sequences. The created plasmids 
were digested with HmdIII and religated to create the ARS, ACS1- 
CS2, ACS1 and ACS2 mutants (Table 1). Using the same uridylated 
template and strategy, an additional Xhol site was introduced at nt 
10776. This plasmid was cut with Xhol and religated to yield the ASS 
mutant (Table 1). The relevant parts of these pHYF5'3'IV derivatives 
were verified by DNA sequencing and then cloned into the full- 
length YF cDNA. 

In vitro transcription. Plasmids containing the full-length YF 
cDNA were linearized with Xhol and purified by phenol/chloroform 
extraction and ethanol precipitation. Run-off RNA transcripts were 
synthesized in vitro using SP6 RNA polymerase (Rice et al, 1987). 
Trace amounts of [ 3 H]UTP were included in the reaction to quanti- 
tate the yield. Transcripts were used for transfection without any 
additional purification. 

RNA transfections. A transfection plaque assay (Grakoui et al, 
1989) on SW13 cells was used to determine the infectivity of YF-17D 
transcripts in p.f.u. (ug RNA) -1 . In short, almost confluent mono- 
layers of SW13 cells in 35 mm diameter dishes were washed twice 
with PBS lacking Ca 2+ and Mg 2 + . A mixture of 01-1-0 ng wt 
or mutant YF-17D RNA transcripts and 4 ug lipofectin (Life 
Technologies) in 200 ul PBS was added to the cells. After 15 min, the 
transfection mixture was removed, the cells were washed once with 
PBS and a DMEM/agarose overlay was applied as described for the 
plaque assays. Plaques were identified by crystal-violet staining after 
incubation for 4 days at 37 °C (Rice et al., 1989). 

For direct analysis of experiments in which viral RNA synthesis and 
protein expression was analysed directly in the transfected cells, BHK- 
21 J cells were electroporated with 5 ug of in vifro-transcribed YF RNA 



as described previously (Lindenbach & Rice, 1997). Aliquots were 
taken from the medium of the transfected cells to quantify the 

Immunofluorescence. Transfected cells were grown on coverslips. 
At 24 h post-transfection or infection, the cells were fixed with 3 % 
paraformaldehyde in PBS (pH 7-4) for at least 30 min and washed 
with PBS containing 10 mM glycine. Following permeabilization 
with 0-1% Triton X-100 in PBS, the cells were incubated in PBS 
containing 2 % horse serum for 1 h to minimize non-specific immuno- 
fluorescence. Indirect immunofluorescence was carried out with a 
1 : 1000 dilution of mAb 1A5 (provided by J.J. Schlesinger) in PBS, 
which is specific for the YF NS1 protein (Schlesinger et al., 1983) 
and visualized with a secondary Cy3-conjugated rabbit anti-mouse 
IgG (Jackson ImmunoResearch Laboratories) diluted 1 : 1000. 

Labelling and analysis of viral RNAs. In general, 3 ml of the 
electroporated BHK-21J cell suspension (approximately 1-5 xlO 6 
cells) was placed in 35 mm plates in DMEM containing 2 % FCS. At 
the indicated times post-electroporation (p.e.), the medium was 
replaced with 750 ul medium containing 2 ug actinomycin D ml -1 
and 50 uCi [ 3 H]uridine ml" 1 . At 24 h p.e., RNA was isolated with 
Trizol (Life Technologies) and resuspended in 21 ul H 2 0. One-third 
of the RNA was denatured with glyoxal and DMSO and analysed by 
electrophoresis in 0-8% MOPS/agarose gels (Sambrook et al, 1989). 
Gels were prepared for fluorography as described previously 
(Bredenbeek et al, 1993). 



RESULTS 



Construction and characterization of a stable, 
full-length yellow fever cDNA clone 

The low-copy-number vector pACNR1181, derived from 
pACNRl 180 (Ruggli et al, 1996), was used as a vector for the 
construction of a full-length YF-17D cDNA (see Methods; 
Fig. IB). Transformation of E. coli Sure cells with the 
ligation reaction that was expected to yield the full-length YF 
cDNA construct resulted in the production of small and 
large bacterial colonies after incubation of the plates for 24 h 
at 37 °C. Both types of colonies were grown in liquid medium 
and plasmids were isolated. Based on restriction enzyme 
digestion patterns all the small colonies («= 14) carried the 
plasmid containing the full-length YF cDNA (pACNR/ 
FLYF-17Dx) whereas the bacteria of the faster-growing 



Table 1. Characteristics of YF mutants containing mutations in conserved nucleotide sequences and/or conserved secondary 
RNA structures 



Construct 


Position of deleted nucleotides in YF 


No. of deleted nucleotides 


RNA synthesis* 


Plaque formationt 


YF-17DA5'-CS 


155-160 


6 




No 


YF-17DARS 


10333-10520 


188 ( + aagcuu)t 


+ + 


Yes 


YF-17DACS1-CS2 


10705-10772 


68 (-t-aagcuu)^ 




No 


YF-17DACS2 


10705-10729 


25 ( + aagcuu)t 


+ 


Yes 


YF-17DACS1 


10748-10772 


25 ( + aagcuu)^ 




No 


YF-17DASS 


10776-10861 


86 




No 


YF-17Dx 






+ + 


Yes 



*As determined by [ 3 H]uridine labelling of electroporated BHK-21J cells. 
tAs determinted by lipofection of SW13 cells. 

:fDue to engineering an extra HmdIII site, six extra nucleotides were inserted. 
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colonies (n = 4) carried only the parental vector 
pACNR1181YF5'3'IV (data not shown). 

Xfcol-linearized p ACNR/FLYF- 1 7Dx template was used for 
in vitro RNA transcription and the resulting full-length YF 
transcripts were electroporated into BHK-21J cells. The trans- 
fected cells were fixed at 24 h p.e. and analysed for the 
expression of YF NS1 by immunofluorescence microscopy. 
Many of the electroporated cells showed a perinuclear, 
punctated signal when stained with an NS1 -specific anti- 
body (Fig. 2A). As a control, cells were transfected with a 
truncated YF-17Dx transcript lacking the 3' terminal 155 
nucleotides of the YF genome (Xbal site in Fig. IB) and 
therefore unlikely to be replication-competent. The cells 
transfected with the truncated YF transcripts failed to 
express any detectable NS1 protein (Fig. 2B). These results 
demonstrated that YF RNA transcribed from pACNR/FLYF- 
17Dxwas replication-competent and that the NS1 signal was 
not merely the result of translation of the input RNA. 

Stability of the infectious YF cDNA in E. coli 

The initial in vitro ligation approach to obtain infectious YF- 
17D RNA was developed because of severe stability problems 
with the full-length YF-17D cDNA in E. coli. Therefore it was 
of great importance to examine the stability of pACNR/ 
FLYF-17Dx in E. coli. The recombination-deficient E. coli 
Sure and DH5a strains and recombination-competent E. coli 
MC1061 strain were transformed with p ACNR/FLYF- 1 7Dx. 
Two colonies from each transformation were propagated for 
ten cycles consisting of alternating growth in liquid medium 
and plating on a selective medium. No significant change in 
the growth characteristics or colony morphology of the 
bacteria was observed during these passages. Restriction 
enzyme digestion of plasmid DNA purified after 5 and 10 
cycles generated the expected DNA pattern (data not shown). 
YF-17Dx RNA transcribed from these purified plasmids was 
used to determine the specific infectivity of the YF-17D 
transcripts. As shown in Table 2, the specific infectivity of the 
RNA transcripts was comparable [3-5-4-5 x 10 5 plaques (jag 
RNA) ~ 1 ], irrespective of the E. coli host that had been used to 
grow the plasmid. More importandy, no significant changes 



(A) (B) 




Fig. 2. Immunofluorescence staining of BHK-21 cells with 
the YF NS1 -specific antibody, 1A5, at 24 h p.e. (A) Cells 
transfected with full-length YF-17Dx transcripts. (B) Cells 
transfected with YF transcripts lacking the 3' 155 nucleotides 
of the YF sequence. 



Table 2. Specific infectivity of plasmid-derived YF-17Dx 
RNA after repeated passaging of pANCR-FLYF-1 7Dx in 
E. coli 



E. coli strain 


Specific infectivity [YF plaqu 


•s (ng RNA)" 1 ]* 


Passage 1 Passage 5 


Passage 10 




2-8 xlO 5 3-8 xlO 5 


3-5 x 10 5 


DH5a 


ND 3-7 xlO 5 


3-5 x 10 5 


MC1061 


ND 4-3 x 10 5 


4-5 x 10 5 



*For each E. coli strain, the specific infectivity is the average of the 
transfection of in vitro-transcribed RNA from two individually 
passaged bacteria cultures. 
ND, Not determined. 



in the number of plaques (|!g RNA) -1 were detected 
after repeated passages of the plasmid in any of the studied 
E. coli strains. 

Kinetics of viral RNA synthesis and virus 
production 

To study the kinetics of YF RNA synthesis and virus 
production directly in transfected cells, BHK-21J were 
electroporated with YF-17Dx transcripts and labelled with 
[ 3 H]uridine in the presence of actinomycin D for 6 h at 6 h 
intervals. RNA replication was first detected after labelling 
the cells from 12 to 18 h p.e. and synthesis peaked between 
18 and 30 h (Fig. 3A). After 30 h, viral RNA synthesis 
decreased, which correlated with the increasing YF-induced 
cytopathic effect on the transfected cells. Virus could be 
detected in the medium of the electroporated cells at 12 h 
p.e. (Fig. 3B). There was a significant rise in virus titre 
between 12 and 18 h p.e., which levelled off between 24 and 
30 h (Fig. 3B) and remained at this level for the remaining 
period of the experiment. 

Next, the growth properties of the parental YF-17D virus 
and the YF-17Dx virus released into the medium of the 
transfected cells were compared. BHK-21 J cells were infected 
at an m.o.i. of 10 with a laboratory strain of YF-17D and 
YF-17Dx virus. Virus was harvested at 12 h intervals and the 
yield was quantified as described previously (Rice et al, 
1989). Similar results were obtained for both virus strains, 
with virus titres in the medium peaking at 36 h and 
being slightly lower at later time points (Fig. 4). In addi- 
tion, the expression of YF proteins was analysed by radio- 
immunoprecipitation of YF proteins in lysates of [ 35 S] 
methionine-labelled BHK-21 J cells infected with either YF- 
17D or YF-17Dx virus. No apparent difference in migration 
or relative levels of the prM, E, NS1, NS2B, NS3, NS4B or 
NS5 proteins was observed (data not shown). 

Conserved flavivirus RNA elements that are 
essential for virus replication 

Several mutant YF- 1 7D cDNAs were constructed by deleting 
either the conserved flavivirus sequence elements (A5'-CS, 



1264 



Journal of General Virology 84 



Characterization of an infectious YF cDNA 



(A) Time post-electroporation (h) 
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Fig. 3. Kinetics of YF RNA synthesis and virus production in 
BHK-21J cells electroporated with YF-17Dx transcripts. (A) 
Kinetics of YF RNA synthesis. The transfected cells were labelled 
for 6 h with 50 uCi [ 3 H]uridine ml" 1 in the presence of actinomy- 
cin D and lysed at the indicated times p.e. (B) Production of YF in 
electroporated BHK-21J cells. Samples (n = 2 per time point) of 
the medium of the transfected cells were taken at the indicated 
times p.e. and used for a plaque assay on BHK-21 J cells to deter- 
mine the YF titre. 



9 




2 J 1 1 1 1 1 

0 12 24 36 48 60 
Time post-infection (h) 



Fig. 4. Comparison of the growth characteristics of YF-17Dx 
virus derived from the infectious clone (♦) with YF-17D virus 
stocks derived from vaccine lots (□). BHK-21J cells were 
infected at an m.o.i. of 10. The medium of the infected cells was 
sampled at the indicated times post-infection. Plaque assays on 
BHK-21 J cells were used to determine the virus titre. 



ACS1, ACS2 and ACS1-CS2), RNA structures (ASS), or 
the subgroup-specific repeated sequences (ARS) (Fig. 1A). 
In vitro RNA transcripts were transfected into SW13 and 
BHK cells by lipofection and electropo ration, respectively. 
Only the ARS and ACS2 mutants were able to form plaques 
after lipofection into SW13 cells at either 31 or 37 °C. 
Plaques were not observed for the A5'-CS, ACS1-CS2, ACS1 
and ASS mutants. The plaques that were formed by the ARS 
mutant were somewhat smaller than the plaques obtained 
with YF-17Dx (Fig. 5A). The plaque morphology of the 
ACS2 mutant varied between experiments. Turbid plaques 
as well as more clear plaques were observed. The size of 
the ACS2 mutant plaques was significantly smaller than 
observed for YF-17Dx plaques (Fig. 5A). 

First cycle RNA analysis was performed to exclude the 
possibility that the lack of plaque formation for 5'-CS, 
ACS1-CS2, ACS1 and ASS mutants was due to the deletion 
of viral sequences that are involved in virion assembly, e.g. 
the encapsidation signal. BHK-21 J cells were electroporated 
with in vitro transcripts of either wt YF-17Dx or the deletion 
mutants and labelled with [ 3 H]uridine from 18 to 24 h p.e. 
in the presence of actinomycin D. Total RNA was isolated 
from the transfected cells and analysed on a denaturing 
agarose gel. As shown in Fig. 5(B), synthesis of YF RNA 
could only be detected in the cells transfected with the ARS 
and ACS2 mutants. No viral RNA synthesis was detected 
for the A5'-CS, ACS1-CS2, ACS1 and ASS mutants. 




WT ARS ACS2 




Fig. 5. (A) Plaque morphology of YF-17Dx, YF-17DARS and 
YF-1 7DACS2 after transfection plaque assays on SW13 cells. 
The transfected cells were incubated at 37 °C, 5% CO z for 
96 h, fixed with 8% formaldehyde and stained with crystal 
violet. (B) Analysis YF RNA synthesis in BHK-21J cells electro- 
porated with in w'fro-transcribed RNA of the indicated YF 
mutants. Transfected cells were labelled with [ 3 H]uridine in the 
presence of actinomycin D between 1 8 and 24 h p.e. 
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To compare the growth characteristics of the mutants and 
the parental viruses BHK-21J cells were infected at an m.o.i. 
of 10 with either the ARS or ACS2 mutants or with wt YF- 
1 7Dx virus. Virus was harvested at 8 h intervals and the yield 
was determined by plaque assay (Fig. 6). All three viruses 
showed a rapid increase in titre between 8 and 24 h, with 
virus production peaking at around 32 h and than level- 
ling off. The kinetics of ARS and ACS2 virus production 
appeared a little slower than wt virus and the maximum titre 
was also somewhat lower. Both the ARS and ACS2 mutants 
showed a clear cytopathic effect. 



DISCUSSION 

Recombinant cDNA clones from which full-length infec- 
tious RNA can be transcribed are a valuable tool for studying 
the molecular biology of positive-strand RNA viruses (Boyer 
& Haenni, 1994). The approach relies on the infectious 
nature of the genome RNA of these viruses when transfected 
into permissive host cells. However, the construction of such 
functional cDNAs for flaviviruses has proven difficult 
(Ruggli & Rice, 1999, for a review). Often the plasmids 
containing a full-length cDNA of these viruses are unstable 
in E. coli. For YF, this problem was initially circumvented 
by using two plasmids and an in vitro ligation approach 
(Rice et al, 1989). This strategy yielded the first functional 
flavivirus cDNA for in vitro transcription of infectious viral 
RNA. In this report, the construction and characterization 
of a stable, full-length YF cDNA in an E. coli plasmid vector 
has been described. The YF cDNA fragments that were used 
to assemble this clone were taken from the plasmids 
pYF5'3TV and pYFM5.2, which were previously used to 
create the full-length cDNA template for the production of 
infectious YF RNA by in vitro ligation (Rice ef al, 1989). The 




0 8 16 24 32 40 48 



Time post-infection (h) 



Fig. 6. Growth characteristics YF-1 7DARS (♦) and YF- 
1 7DACS2 (A) virus compared with YF-1 7Dx (■). BHK-21J cells 
were infected with YF-1 7Dx, YF-1 7DARS and YF-1 7DACS2 virus 
at an m.o.i. of 10. Samples of the media were collected at the 
indicated times post infection. The virus titre was determined by 
plaque assay on BHK-21J cells. The growth curves are based on 
the average virus titre in two samples per time point per mutant. 



low-copy-number plasmid pACNR1180 was used as a 
vector for the construction of the full-length YF cDNA. This 
vector is derived from pACYC177 and has previously been 
used to construct stable, infectious pestivirus cDNAs 
(Mendez et al, 1998; Ruggli et al, 1996). The RNA trans- 
cribed from p ACNR/FLYF- 1 7Dx had a specific infectivity in 
the order of 10 5 — 10 6 p.f.u. (ng RNA) -1 as determined by 
lipofection. This was significantly higher than reported for 
the YF RNA obtained from in v?'fro-ligated templates, even 
when these values are corrected for the percentage of full- 
length YF RNA in the RNA transcribed from the ligated 
template (Rice et al, 1989). Similar specific infectivities have 
only been reported for a few other flavivirus clones, such as 
tick-borne encephalitis virus (Mandl et al, 1997) and 
Murray Valley encephalitis virus (Hurrelbrink et al, 1999). 
The progeny virus obtained from the p ACNR/FLYF- 1 7Dx 
clone showed similar kinetics in proteolytic processing, 
viral RNA synthesis and growth as the parental virus in 
cell culture. 

The stability of p ACNR/FLYF- 1 7Dx was extensively tested 
in the E. coli strains Sure, DH5a and MC1061. No changes in 
E. coli colony morphology or growth characteristics were 
observed during the passaging of pACNR/FLYF- 1 7Dx in 
these bacteria. More importantly, RNA transcribed from 
plasmid DNA isolated at passages 5 and 10 showed a similar 
specific infectivity as RNA derived from the originally iso- 
lated plasmids. These results demonstrate the successful 
construction of a stable full-length YF cDNA in a plasmid 
vector that can be used for in vitro transcription of highly 
infectious viral RNA. Apart from the mutagenesis studies 
that are described in this report, the p ACNR/FLYF- 1 7Dx 
clone has already been used successfully in other studies 
(Amberg & Rice, 1999; Kummerer & Rice, 2002; Lindenbach 
& Rice, 1999). 

As for all RNA viruses, the 5'- and 3'-NTRs of the YF 
genome are believed to play a crucial role in the initiation of 
viral RNA synthesis. Both the YF 5'- and 3'-NTRs contain 
sequence motifs and/or stem-loop structures (Fig. 1) that 
are well conserved among flaviviruses. It has been suggested 
that these domains are essential for viral RNA synthesis. In 
this study, the role of the conserved flavivirus sequences (5'- 
CS, RS, CS2 and CS1) in virus replication was analysed, 
as well as the stem-loop structure (SS) at the 3' end of the 
viral genome. 

Deleting the RS domain was well tolerated by YF and 
resulted in a virus with similar biological properties to the 
YF-17Dx virus. A domain comparable with the YF RS 
sequences has also been found in the JE virus serogroup, but 
there is no sequence similarity in the RS domains of YF and 
IE-like viruses. For KUN, a deletion of 352 nucleotides in the 
5'-proximal half of the 3'-NTR resulted in a five- to tenfold 
reduction in replication (Khromykh & Westaway, 1997). 
This deletion was upstream of CS2 and included the RS 
sequences. RS is lacking in DEN. Deletions in this region of 
DEN type 4 yielded viruses with delayed growth properties 
and a smaller plaque size (Men et al, 1996). These deletions 



1266 



Journal of General Virology 84 



Characterization of an infectious YF cDNA 



always included the DEN CS2B element and are therefore 
difficult to compare with the YF-17DARS mutant, which 
retained CS2. 

RNA secondary structure analysis of flaviviruses predicts 
that the YF CS2 sequence forms an independent stem- 
loop structure within the 3'-NTR (Olsthoorn & Bol, 2001; 
Proutski etal, 1997b). The fact that this stem-loop structure 
is well conserved in both pathogenic and vaccine strains 
of YF suggests that this sequence is essential for virus 
replication. However, analysis of YF-17DACS2 showed that 
this sequence can be deleted with relatively minor effects 
on virus replication. Compared with YF-17Dx, the rate of 
RNA synthesis by YF-17DACS2 was somewhat decreased. 
However, the kinetics of virus production of YFACS2 was 
similar to YF-17Dx. The reason for the observed variability in 
YF-17DACS2 plaque size and morphology is unclear. Small 
and turbid plaques have also been reported for DEN type 4 
deletion mutants involving CS2 (Men etal, 1996). RT-PCR 
and sequence analysis showed that the deletion was still 
present in YF-17DACS2 progeny virus; however, second-site 
revertants cannot be excluded. 

All the mosquito-borne flaviviruses contain a 5' conserved 
sequence that is located a few nucleotides downstream of the 
translation initiation codon. This 5'-CS is actually part of 
the flavivirus coding sequence. It has been suggested that the 
flavivirus 5'-CS sequence base pairs with CS1 via a long- 
range RNA interaction (Hahn et al, 1987; Khromykh 
et al, 2001). This interaction is predicted to result in a 'pan- 
handle' -like structure that is hypothesized to be required 
for virus replication by modulating virus translation 
(Khromykh et al, 2001). Recently it was shown, using a 
KUN replicon and a DEN NS5-based in vitro polymerase 
assay, that complementarity between 5'-CS and CS1 is a 
prerequisite for viral RNA synthesis (Khromykh et al, 2001; 
You et al, 2001). The observation that the partial deletion of 
YF CS1 is lethal for viral RNA synthesis, as reported in 
this study, is in agreement with a model that requires 
'circularization' at some stage of the flavivirus replication 
cycle. The involvement of CS1 in the 'cyclization' of the viral 
genomic RNA does not exclude the possibility that the CS1 
sequence might also take part in an alternative structure 
involving base-pairing to other domains within the 3'-NTR. 
(Proutski et al, 1997a, b; Rauscher et al, 1997; Shi et al, 
1996). It can be hypothesized that these different RNA 
structures involving CS1 are metastable and in equilibrium 
with each other. This equilibrium may be influenced by 
factors like RNA-protein interactions and the cellular 
environment, thereby regulating negative-strand RNA syn- 
thesis versus positive-strand RNA synthesis or availability 
of the RNA for translation. 

Another important element of the 3'-NTR RNA is formed 
by the 3 '-terminal 86 nucleotides that are deleted in the YF- 
17DASS mutant. In all the models describing the flavivirus 
3'-NTR RNA, these nucleotides are involved in the forma- 
tion of a hairpin structure (Hahn et al, 1987; Mackenzie 
et al, 2001; Proutski et al, 1997b; Shi et al, 1996). Deletion 



of the 3' 86 nucleotides was lethal for YF RNA synthesis as 
shown by the results with the YF-17DASS mutant. 

Finally, as one of the safest and most effective human 
vaccines, YF-17D recombinants are being vigorously 
explored as candidate vaccines for other flavivirus diseases, 
such as JE (Monath, 2002), DEN (Der Most et al, 2000) and 
West Nile virus (Monath, 2001), as well as for cancer 
vaccines (McAllister et al, 2000). The availability of full- 
length stable YF cDNA clones should help in these efforts. 
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